• ConnNet: A Long-Range Relation-Aware Pixel-Connectivity Network for Salient Segmentation 

      Kampffmeyer, Michael C.; Dong, Nanqing; Liang, Xiaodan; Zhang, Yujia; Xing, Eric P. (Journal article; Tidsskriftartikkel; Peer reviewed, 2018-12-14)
      Salient segmentation aims to segment out attention-grabbing regions, a critical yet challenging task and the foundation of many high-level computer vision applications. It requires semantic-aware grouping of pixels into salient regions and benefits from the utilization of global multi-scale contexts to achieve good local reasoning. Previous works often address it as two-class segmentation problems ...
    • Deep divergence-based approach to clustering 

      Kampffmeyer, Michael C.; Løkse, Sigurd; Bianchi, Filippo Maria; Livi, Lorenzo; Salberg, Arnt Børre; Jenssen, Robert (Journal article; Tidsskriftartikkel; Peer reviewed, 2019-02-08)
      A promising direction in deep learning research consists in learning representations and simultaneously discovering cluster structure in unlabeled data by optimizing a discriminative loss function. As opposed to supervised deep learning, this line of research is in its infancy, and how to design and optimize suitable loss functions to train deep neural networks for clustering is still an open question. ...
    • The deep kernelized autoencoder 

      Kampffmeyer, Michael C.; Løkse, Sigurd; Bianchi, Filippo Maria; Jenssen, Robert; Livi, Lorenzo (Journal article; Tidsskriftartikkel; Peer reviewed, 2018-07-18)
      Autoencoders learn data representations (codes) in such a way that the input is reproduced at the output of the network. However, it is not always clear what kind of properties of the input data need to be captured by the codes. Kernel machines have experienced great success by operating via inner-products in a theoretically well-defined reproducing kernel Hilbert space, hence capturing topological ...
    • Deep kernelized autoencoders 

      Kampffmeyer, Michael C.; Løkse, Sigurd; Bianchi, Filippo Maria; Jenssen, Robert; Livi, Lorenzo (Peer reviewed; Book; Bokkapittel; Bok; Chapter, 2017-05-19)
      In this paper we introduce the deep kernelized autoencoder, a neural network model that allows an explicit approximation of (i) the mapping from an input space to an arbitrary, user-specified kernel space and (ii) the back-projection from such a kernel space to input space. The proposed method is based on traditional autoencoders and is trained through a new unsupervised loss function. ...
    • Deep Reinforcement Learning for Query-Conditioned Video Summarization 

      Zhang, Yujia; Kampffmeyer, Michael C.; Zhao, Xiaoguang; Tan, Min (Journal article; Tidsskriftartikkel; Peer reviewed, 2019-02-21)
      Query-conditioned video summarization requires to (1) find a diverse set of video shots/frames that are representative for the whole video, and that (2) the selected shots/frames are related to a given query. Thus it can be tailored to different user interests leading to a better personalized summary and differs from the generic video summarization which only focuses on video content. Our work targets ...
    • Density ridge manifold traversal 

      Myhre, Jonas Nordhaug; Kampffmeyer, Michael C.; Jenssen, Robert (Chapter; Bokkapittel, 2017-06-19)
      The density ridge framework for estimating principal curves and surfaces has in a number of recent works been shown to capture manifold structure in data in an intuitive and effective manner. However, to date there exists no efficient way to traverse these manifolds as defined by density ridges. This is unfortunate, as manifold traversal is an important problem for example for shape estimation in ...
    • Dilated temporal relational adversarial network for generic video summarization 

      Zhang, Yujia; Kampffmeyer, Michael C.; Liang, Xiaodan; Zhang, Dingwen; Tan, Min; Xing, Eric P. (Journal article; Tidsskriftartikkel; Peer reviewed, 2019-10-12)
      The large amount of videos popping up every day, make it more and more critical that key information within videos can be extracted and understood in a very short time. Video summarization, the task of finding the smallest subset of frames, which still conveys the whole story of a given video, is thus of great significance to improve efficiency of video understanding. We propose a novel Dilated ...
    • Large-Scale Mapping of Small Roads in Lidar Images Using Deep Convolutional Neural Networks 

      Salberg, Arnt Børre; Trier, Øivind Due; Kampffmeyer, Michael C. (Chapter; Bokkapittel, 2017-05-19)
      Detailed and complete mapping of forest roads is important for the forest industry since they are used for timber transport by trucks with long trailers. This paper proposes a new automatic method for large-scale mapping forest roads from airborne laser scanning data. The method is based on a fully convolutional neural network that performs end-to-end segmentation. To train the network, a large set ...
    • Learning representations of multivariate time series with missing data 

      Bianchi, Filippo Maria; Livi, Lorenzo; Mikalsen, Karl Øyvind; Kampffmeyer, Michael C.; Jenssen, Robert (Journal article; Tidsskriftartikkel; Peer reviewed, 2019-07-19)
      Learning compressed representations of multivariate time series (MTS) facilitates data analysis in the presence of noise and redundant information, and for a large number of variates and time steps. However, classical dimensionality reduction approaches are designed for vectorial data and cannot deal explicitly with missing values. In this work, we propose a novel autoencoder architecture based on ...
    • Reinforced Auto-Zoom Net: Towards Accurate and Fast Breast Cancer Segmentation in Whole-Slide Images 

      Dong, Nanqing; Kampffmeyer, Michael C.; Liang, Xiaodan; Wang, Zeya; Dai, Wei; Xing, Eric P. (Journal article; Tidsskriftartikkel; Peer reviewed, 2018-09-20)
      Convolutional neural networks have led to significant breakthroughs in the domain of medical image analysis. However, the task of breast cancer segmentation in whole-slide images (WSIs) is still underexplored. WSIs are large histopathological images with extremely high resolution. Constrained by the hardware and field of view, using high-magnification patches can slow down the inference process and ...
    • Rethinking knowledge graph propagation for zero-shot learning 

      Kampffmeyer, Michael C.; Chen, Yinbo; Liang, Xiaodan; Wang, Hao; Zhang, Yujia; Xing, Eric P. (Journal article; Tidsskriftartikkel; Peer reviewed, 2019)
      Graph convolutional neural networks have recently shown great potential for the task of zero-shot learning. These models are highly sample efficient as related concepts in the graph structure share statistical strength allowing generalization to new classes when faced with a lack of data. However, multi-layer architectures, which are required to propagate knowledge to distant nodes in the graph, ...
    • Temporal overdrive recurrent neural network 

      Bianchi, Filippo Maria; Kampffmeyer, Michael C.; Maiorino, Enrico; Jenssen, Robert (Chapter; Bokkapittel, 2017-07-03)
      In this work we present a novel recurrent neural network architecture designed to model systems characterized by multiple characteristic timescales in their dynamics. The proposed network is composed by several recurrent groups of neurons that are trained to separately adapt to each timescale, in order to improve the system identification process. We test our framework on time series prediction tasks ...
    • Uncertainty and interpretability in convolutional neural networks for semantic segmentation of colorectal polyps 

      Wickstrøm, Kristoffer Knutsen; Kampffmeyer, Michael C.; Jenssen, Robert (Journal article; Tidsskriftartikkel; Peer reviewed, 2019-11-20)
      Colorectal polyps are known to be potential precursors to colorectal cancer, which is one of the leading causes of cancer-related deaths on a global scale. Early detection and prevention of colorectal cancer is primarily enabled through manual screenings, where the intestines of a patient is visually examined. Such a procedure can be challenging and exhausting for the person performing the screening. ...
    • Uncertainty modeling and interpretability in convolutional neural networks for polyp segmentation 

      Wickstrøm, Kristoffer Knutsen; Kampffmeyer, Michael C.; Jenssen, Robert (Journal article; Tidsskriftartikkel; Peer reviewed, 2018-11-01)
      Convolutional Neural Networks (CNNs) are propelling advances in a range of different computer vision tasks such as object detection and object segmentation. Their success has motivated research in applications of such models for medical image analysis. If CNN-based models are to be helpful in a medical context, they need to be precise, interpretable, and uncertainty in predictions must be well ...
    • Unsupervised domain adaptation for automatic estimation of cardiothoracic ratio 

      Dong, Nanqing; Kampffmeyer, Michael C.; Liang, Xiaodan; Wang, Zeya; Dai, Wei; Xing, Eric P. (Journal article; Tidsskriftartikkel; Peer reviewed, 2018-09-26)
      The cardiothoracic ratio (CTR), a clinical metric of heart size in chest X-rays (CXRs), is a key indicator of cardiomegaly. Manual measurement of CTR is time-consuming and can be affected by human subjectivity, making it desirable to design computer-aided systems that assist clinicians in the diagnosis process. Automatic CTR estimation through chest organ segmentation, however, requires large amounts ...
    • Urban land cover classification with missing data modalities using deep convolutional neural networks 

      Kampffmeyer, Michael C.; Salberg, Arnt Børre; Jenssen, Robert (Journal article; Tidsskriftartikkel; Peer reviewed, 2018-06-14)
      Automatic urban land cover classification is a fundamental problem in remote sensing, e.g., for environmental monitoring. The problem is highly challenging, as classes generally have high interclass and low intraclass variances. Techniques to improve urban land cover classification performance in remote sensing include fusion of data from different sensors with different data modalities. However, ...